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Abstract 

The reassembling of a simple connected graph G = {V,E) is an abstraction of a problem arising in earlier 
studies of network analysis. Its simplest formulation is in two steps: 

(1) We cut every edge of G into two halves, thus obtaining a collection of n = \ V\ one-vertex components, 
such that for every v eV the one-vertex component {u} has | degree (v) \ half edges attached to it. 

(2) We splice the two halves of every edge together, not of all the edges at once, but in some ordering 0 of the 
edges that minimizes two measures that depend on the edge-boundary degrees of assembled components. 

A component A is a subset of V and its edge-boundary degree is the number of edges in G with one endpoint in 
A and one endpoint inV - A (which is the same as the number of half edges attached to A after all edges with 
both endpoints in A have been spliced together). The maximum edge-boundary degree encountered during the 
reassembling process is what we call the a-measure of the reassembling, and the sum of all edge-boundary 
degrees is its /3-measure. The a-optimization (resp. /3-optimization) of the reassembling of G is to determine 
an order 0 for splicing the edges that minimizes its a-measure (resp. /3-measure). 

There are different forms of reassembling, depending on restrictions and variations on the ordering 0 of the 
edges. We consider only cases satisfying the condition that if the an edge between disjoint components A and 
B is spliced, then all the edges between A and B are spliced at the same time. In this report, we examine the 
particular case of linear reassembling, which requires that the next edge to be spliced must be adjacent to an 
already spliced edge. We delay other forms of reassembling to follow-up reports. 

We prove that a-optimization of linear reassembling and minimum-cutwidth linear arrangment (CutWidth) are 
polynomially reducible to each other, and that /3-optimization of linear reassembling and minimum-cost linear 
arrangement (MinArr) are polynomially reducible to each other. The known NP-hardness of CutWidth and 
MinArr imply the NP-hardness of a-optimization and /3-optimization. 


1 Introduction 

We start with a gentle presentation of our graph problem and then explain the baekground that motivates our 
examination. 

Problem Statement. Let G = {V,E)he. n simple (no self-loops and no multi-edges), eonneeted, undirected 
graph, with | U | = n ^ 1 vertices and \E \ = m edges. One version of the reassembling of G is edge-directed 
and can be defined by a total order 0 of the m edges of G. Informally and very simply, a total order 0 of the 
edges gives rise to a reassembling of G as follows: 

(1) We cut every edge into two halves, thus obtaining a collection of n disconnected one-vertex components, 
such that for every n e U the one-vertex component {n} has | degree {v) \ half edges attached to it. 

(2) We reconnect the two halves of every edge in the order specified by 0, obfaining larger and larger compo- 
nenfs, until fhe original G is fully reassembled. 

^Partially supported by NSF awards CCF-0820138 and CNS-1135722. 
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To distinguish this reassembling of G according to an order 0 from a later reassembling of G more suitable for 
parallel computation, we call the former sequential reassembling and the latter binary reassembling. 

A bridge is a yet-to-be-reconnected edge between two components, say, A and B, with disjoint sets of 
vertices; we call such components The set of bridges between A and B is denoted d{A,B). For 

technical reasons, when we reconnect one of the bridges in d{A, B), we also reconnect all the other bridges in 
d{A, B) and cross them out from further consideration in 0. Thus, in the reassembling of G according to 0, 
there are at most m steps, rather than exactly m steps. 

In the case when B = V - A, the set d{A, B) is the same as the cut-set of edges determined by the cut 
(A, V - A). Instead of d{A, V - A), we write d{A). The edge-boundary degree of a cluster A is the number 
of bridges with only one endpoint in A, i.e., \ d{A) \. 

Several natural optimization problems can be associated with graph reassembling. Two such optimizations 
are the following, which we identify by the letters a and /3 throughout: 

(a) Minimize the maximum edge-boundary degree encountered during reassembling. 

(/3) Minimize the sum of all edge-boundary degrees encountered during reassembling. 

Initially, before we start reassembling, we always set the a-measure Ma to the maximum of all the vertex 
degrees, i.e., max {degree(v) |n e I/}, and we set the /3-measure to the sum of the vertex degrees, i.e., 
Y, {degree{v) \ v ^V}, regardless of which strategy, i.e., the order 0 of edges, is selected for the reassembling. 
During reassembling, after we merge disjoint nonempty clusters A and B, we update the a-measure Mq, to: 
max {Mci, I d{A u i?) |}, and the /3-measure Mj^ to: (M^ -i- | d{A u B) |). The reassembling process terminates 
when only one cluster remains, which is also the set V of all the vertices. 

In what we call the binary reassembling of G, we reconnect bridges in several non-overlapping pairs 
of clusters simultaneously. That is, at every step - which we may call a parallel step for emphasis - we 
choose k ^ 1 and choose k cluster pairs {Ai,Bi), ..., (A^, B^), where Ai, i3i,..., A^, B^ are 2k pairwise 
disjoint clusters {i.e., with pairwise disjoint subsets of vertices), and simultaneously reconnect all the bridges 
in d{Ai,Bi). The subsets Ai, i?i,..., Ak,Bk may or may not include all of the vertices, i.e., in general 

'i' Bi) £ V rather than = V. (We write “ta” to denote disjoint union.) 

A binary reassembling is naturally viewed as vertex-directed and described by a binary tree B - root at the 
top, leaves at the bottom - with n leaf nodes, one for each of the initial one-vertex clusters. Each non-leaf node 
in 13 is a cluster (A ta i3) obtained by reconnecting all the bridges in d{A, B) between the sibling clusters A 
and B. The first parallel step, i = 0, starts at the bottom in the reassembling process, by considering the n leaf 
nodes of B and calculating the max (for a optimization) or the sum (for /3 optimization) of all vertex degrees. 
If h is the height of B, the last parallel step is z = /i, which corresponds to the root node of B (the entire set V 
of vertices) and produces the final a-measure and /3-measure. Clearly, [logn] ^ h ^ n - 1. 

Every sequential reassembling of G can be viewed as a binary reassembling of G where, at every step, only 
one cluster pair (A, B) is selected and one nonempty set of bridges d{A, B) is reconnected. Conversely, by 
serializing (or sequencializing) parallel steps, every binary reassembling which we call strict can be re-defined 
as a sequenfial reassembling. Defails of fhe correspondence befween sequential and binary reassemblings are 
in Appendix [A| 

A binary reassembling is strict if fhe merging of a cluster pair (A, B) is resfricfed fo fhe case d{A, B) ^ 0. 
If an a-opfimal (resp. /3-opfimal) binary reassembling is sfricf, fhen ifs serializafion is an a-opfimal (resp. /3- 
opfimal) sequenfial reassembling. 

The Linear Case. A possible and nafural variafion (or resfricfion) of graph reassembling is one which we call 
linear. If, af every sfep of fhe reassembling process, we require fhaf fhe cluster pair (A, 23) fo be merged is such 
thaf one of fhe fwo clusters, A or 23 (or bofh af fhe firsf sfep), is a singleton sef, fhen fhe resulfing reassembling is 

'These terms (bridge, cluster, and others, later in this report) are overloaded in graph-theoretical problems. We make our own use 
of these terms, and state it explicitly when our meaning is somewhat at variance with that elsewhere in the literature. 
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linear. The binary tree B describing a linear reassembling of a graph G with re vertices is therefore a degenerate 
tree of height h = n-1. 

Clearly, there can be no non-trivial parallel step which merges two (or more) cluster pairs in linear reassem¬ 
bling, i.e., no step which simultaneously merges disjoint cluster pairs and (^ 2 , - 82 ) to form the non¬ 

singleton clusters {Ai&Bi) and {A 2 'i>B 2 ), before they are merged to form the cluster {^{Ai&Bi)&{A 2 ^B 2 )\ 
Another useful way of understanding a linear reassembling of graph G is in its sequential formulation 
(when the reassembling is strict): The order 0 for reconnecting the edges is such that the next edge to be 
reconnected is always adjacent to an already re-reconnected edge, which enforces the requirement that the next 
cluster pair {A, B) to be merged is always such that A or i? is a singleton set. 

There are other natural variations of graph reassembling, such as balanced reassembling, whose binary 
tree B maximizes the merging of cluster pairs at every parallel step and whose height h is therefore [ log re]. We 
study these in follow-up reports. 

Background and Motivation. Besides questions of optimization and the variations which it naturally sug¬ 
gests, graph reassembling (and the related operation of graph assembling, not considered in this paper) is part 
of the execution by programs in a domain-specific language (DSL) for the design of flow networks [[2ir71[^[T5l. 
In network reassembling, the network is taken apart and reassembled in an order determined by the designer; 
in network assembling, the order in which components are put together is pre-determined, which is the order in 
which components become available to the designer. 

A flow network is a directed graph where vertices and edges are assigned various attributes that regulate 
flow through the network]^ Programs for flow-network design are meant to connect network components in 
such a way that typings at their interfaces, i.e., formally specified properties at their common boundaries, are 
satisfied. Network typings guarantee there are no conflicting data types when different components are con¬ 
nected, and insure that desirable properties of safe and secure operation are not violated by these connections, 
i.e., they are invariant properties of the whole network construction. 

A typing r for a network component A (or cluster A in this report’s terminology) formally expresses a 
constraining relationship between the variables denoting the outer ports of A (or the edge-boundary d{A) in 
this report). The smaller the set of outer ports of A is, the easier it is to formulate the typing r and to test 
whether it is compatible with the typing t' of another network component A'. Although every outer port of A 
is directed, as input port or output port, the complexity of the formulation of r depends only on the number of 
outer ports (or | d{A) \ in this report), not on their directions. 

If /c is a uniform upper bound on the number of outer ports of all network components, the time complexity 
of reassembling the network without violating any component typing r can be made linear in the size re of the 
completed network and exponential in the bound k - not counting the preprocessing time /(re) to determine 
an appropriate reassembling order. Hence, the smaller are k and /(re), the more efficient is the construction of 
the entire network. From this follows the importance of minimizing the preprocessing time /(re) for finding a 
reassembling strategy that also minimizes the bound k. 

Main Results. In this report, we restrict attention to the linear case of graph reassembling, which is interesting 
and natural in its own right. We first prove that a-optimization and /3-optimization of linear reassembling are 
both NP-hard problems. We obtain these results by showing that: 

• a-optimization of linear reassembling and minimum-cutwidth linear arrangment (CutWidth) 
are polynomially-reducible to each other, 

• /3-optimization of linear reassembling and minimum-cost linear arrangement (MinArr) 
are polynomially-reducible to each other. 

^Such networks are typically more complex than the capacited directed graphs that algorithms for max-flow (and other related 
quantities) and its generalizations (e.g., multicommodity max-flow) operate on. 
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Both CutWidth and MinArr have been extensively studied: They are both NP-hard in general, but, in a few non¬ 
trivial special cases, amenable to low-degree polynomial-time solutions IIT3ll . By our polynomial-reducibility 
results, these polynomial-time solutions are transferable to the a-optimization and /3-optimization of linear 
reassembling. This leaves open the problem of identifying classes of graphs, whether of practical or theoretical 
significance, for which there are low-degree polynomial-time solutions for our two optimization problems. 

Organization of the Report. In Section we give precise formal definitions of several notions underlying 
our entire examination. The formulation of some of these (e.g., our definition of binary trees) is not standard, 
but which we purposely choose in order to facilitate the subsequent analysis. 

In Sectionwe give several examples to illustrate notions discussed in this Introduction, in Section]^ as 
well as in later sections. 

In Sections|^and[^we prove our NP-hardness results about a-optimization and /3-optimization. Some of 
the long technical proofs for these sections are delayed to Appendix]^ 

In the final shorf Secfion we poinf fo open problems and fo furfher currenf research on a-opfimizafion 
and /3-opfimizafion of graph reassembling. 

2 Formal Definitions and Preliminary Lemmas 

Lef G = {V,E) be a. simple (no self-loops and no mulfi-edges) undirecfed graph, wifh \ V\ = n ^ 1 and 
\ E \ = m ^ 1. Throughouf, fhere is no loss of generality in assuming fhaf G is connecfed. 

As pointed ouf in Secfion [TJ fhere are fwo disfincf definitions of fhe reassembling of G. The firsf is 
easier fo sfafe informally: This is fhe definifion of sequential reassembling. The second definifion, binary 
reassembling, is more convenienf for fhe opfimizafion problems we wanf fo sfudy. The analysis in Ibis report 
and follow-up reporfs is based on fhe formal definition of binary reassembling and ifs variafions; we delay a 
formal definifion of sequenfial reassembling fo Appendix]^ where we also skefch fhe proof of fhe equivalence 
of fhe fwo definifions when reassembling is strict (see Definifion]^. 

2.1 Binary Graph-Reassembling 

Our notion of a binary reassembling of graph G presupposes fhe nofion of a binary tree over a finife sef 
V = {fi,..., Un}. Our definifion of binary trees is nof sfandard, buf is more convenienf for our purposes]^ 

Definition 1 {Binary trees). An (unordered) binary tree B over V = {r;i,..., Vn} is a collecfion of non-empty 
subsefs of V safisfying fhree condifions: 

1. For every v €V, fhe singleton sef {?;} is in B. 

2. The full sef V is in B. 

3. For every X € B, fhere is a unique Y € B such fhaf: X nY = 0 and (X uY) € B. 

The leaf nodes of B are fhe singleton sefs {u}, and fhe root node of B is fhe full sef V. Depending on fhe 

confexf, we may refer fo fhe members of B as ifs nodes or as ifs clusters. Several expecfed properfies of B, 

reproducing familiar ones of a sfandard definifion, are sfafed in fhe nexf fwo proposifions. □ 

Proposition 2 (Properfies of binary frees). Let B be a binary tree as in Definition^ let v €V, and let: 

(t) {^;} = Xo ^ Xi ^X2 ^ ^ Xp = V 

standard definition of a binary tree T makes T a subset of the set of finite binary strings {0,1}* * such that: 

• T is prefix-closed, i.e., iftsT and u is a prefix of t, then u eT. 

• Every node has two children, i.e., f 0 e T iff f 1 s T for every t s {0,1}*. 

The root node of T is the empty string e, and a leaf node of T is a string tsT without children, i.e ., both tO and 11 / T. 
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be a maximal sequence of nested clusters from B. We then have: 

1. The sequence in Q is uniquely defined, i.e., every maximal nested sequence starting from {?;} is the same. 

2. For every cluster Y € B, if{v} £ Y, then Y e {Xq, ..., ^p}- 

3. There are pairwise disjoint clusters {Yq, ..., l^-i} £ B such that, for every 0 < p: 

Xi nYi = 0 and Xi uYi = Xj+i. 

Based on this proposition, we use the following terminology: 

• We call the sequence in ([t]l, which is unique by part 1, the path from the leaf node {u} to the root node V. 

• Every cluster containing v is one of the nodes along this unique path, according to part 2. 

• In part 3, we call Yi the unique sibling of Xi, whose unique common parent is Xi+i, for every 0 < p. 

Proof Part 1 is a consequence of the third condition in Definition [T] which prevents any cluster/node in B from 
having two distinct parents. 

For part 2, consider the nested sequence {u} £ y c f/, and extend it to a maximal nested sequence, which 
is uniquely defined by part 1. This implies Y is one of the nodes along the path from {u} to the root V. 

Part 3 is another consequence of the third condition in Definition [T] For every 0 ^ z < p, y* is the unique 
cluster in B such that j n = 0 and XiUYi € B. Remaining details omitted. □ 

Proposition 3 (Properties of binary trees). Let B be a binary tree as in Definition^ We then have: 

1. For all clusters X,Y ^ B, if X n y ^ 0 then X ^Y orY ^ X. 

2. For every cluster X € B, the sub-collection of clusters Bx ■= { ^ ^ B\Y ^ X} is a binary tree over X, 
with root node X. 

3. B is a collection of {2n- 1) clusters. 

Proof. For part 1, lety = yny t 0 and consider the maximal sequence of nested clusters which extends 
ZcXcyorZcycy. By part 1 of Proposition]^ both X and Y must occur along this nested sequence. 

For part 2, we directly check that Bx satisfies the three defining properties of a binary tree. Straightforward 
details omitted. 

Part 3 is proved by induction on n ^ 1. For the induction hypothesis, we assume the statement is true for 
every binary tree with less than n leaf nodes, and we then prove that the statement is true for an arbitrary binary 
tree over V with | F | = n. Consider a largest cluster X € B such that X tV. There is a unique Y € B such that 
y n y = 0 and X uY € B. Because X is largest in size but smaller than V, it must be that X u y = V, which 
implies that {X, y} is a two-block partition of V. Consider now the subtrees Bx and By, apply the induction 
hypothesis to each, and draw th desired conclusion. □ 

A common measure for a standard definition of binary trees is height, which becomes for our notion of 
binary trees in Definition [T] 

height{B) ■= maxjp | there is u e F such that 

{u} = Xq ^ Xi ^ ^ Xp = F is a maximal sequence of nested clusters }. 

For a particular node/cluster X € B, the subtree of B rooted at X is Bx, by part 2 in Proposition]^ The height 
of X in 13 is therefore heightQ{X) := height {Bx)- 

If X, y c F are disjoint sets of vertices, d^X, Y) is the subset of edges of G with one endpoint in each 
of X and y. If y = F - X, we write dciX) instead of dciX, V - X). 
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Definition 4 {Binary reassembling). A binary reassembling of the graph G = {V,E) is simply defined by a 
pair {G,B) where 13 is a binary tree over V, as in Definition 

Given a binary reassembling (G,B) of G, two measures are of particular interest for our later analysis, 
namely, for every cluster X € B, the degree of X and the height of X : 

degreeQj^{X) := |5g(X)| and heightQj^{X) := height^{X). 

If the context makes clear the binary reassembling {G,B) - respectively, the binary tree B - relative to which 
these measures are defined, we write degree{J0 and height{X) - respectively, degreeQ{X) and heightQ{X) 
- instead of degree q q{X) and height q q{X)^ □ 

Definition 5 {Strict binary reassembling). We say the binary reassembling (G, B) in Definitionis strict if it 
satisfies the following condition: For all clusters X,Y e 13, if X and Y are sibling nodes, then dciX, Y) t 0 . 
(In Definition|^ when we merge sibling clusters X,Y € B, we do not require that dciX, Y) t 0 .) □ 

2.2 Optimization Problems 

The following definition repeats a definition in Section [T] more formally. 

Definition 6 {Measures on the reassembling of a graph). Let {G, B) be a binary reassembling of G. We define 
the measures a and /3 on (G, B) as follows: 

a{G,B) ■= max. [degreeQQ{X) | W e 13}, 

/3{G,B):=Y. {degreea,i3iX)\X€B}. 

An optimization problem arises with the minimization of each of these measures. For example, the optimal 
a-measure of graph G is: 

q;(G) = min { a(G, B) | (G, B) is a binary reassembling of G }. 

We say the binary reassembling {G,B) is a-optimal iff a{G,B) = a{G). We leave to the reader the obvious 
similar definition of what it means for the binary reassembling (G, B) to be /3-optimal. □ 

2.3 Linear Graph-Reassembling 

Let G = {V, E) be a simple undirected graph with \ V\ = n. We say the binary reassembling (G, B) is linear if 
13 is a linear binary tree over V, i.e., all the clusters of size ^ 2 forms a single nested chain of length (n - 1): 

(A) ^ ^ X„_i = y 

This implies the height of 13 is (n - 1). By part 3 in Proposition]^ there are n leaf nodes/singleton clusters 
{Yb) • • •) Y'n-i} £ B such that = lb u and for every 1 ^ 1 ^ re - 2 : 

(B) n Yi+i = 0 and Xi u Yi+i = Xi+i. 

We use the letter L to denote a linear binary tree, and write {G,C) to denote a linear reassembling of G. 

In Definition [7| we mostly use the notation and conventions of llT3ll and the references therein. We write 
Ww to denote the edge connecting vertex v and vertex w. 

Our binary reassembling of G can be viewed as the “hierarchical clustering” of G, similar to a method of analysis in data mining, 
though used for a different purpose. Our binary reassembling mimicks what is called “agglomerative, or bottom-up, hierarchical 
clustering” in data mining. 
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Definition 7 {Linear arrangements and cutwidths). A linear arrangement ip of the graph G = {V,E), where 
I i/ I = n, is a bijection ip '-V We refer to this linear arrangement by writing {G,ip). 

Following |[T3l, given linear arrangement (G, (p) and i e { 1 ,..., n}, we define a two-block partition of the 
vertices, V = L{G, ip, i) ta Ii(G, ip, i), where: 

L{G, ip,i) '■= {v \ ip{v) ^i} and R{G, ip,i) '■= {w € V \ ip{w) > i}. 

The edge cut at position i, denoted C{G, ip, i), is the number of edges connecting L{G, ip, i) and R{G, ip, i): 

C{G,ip,i) :=|{ vw € E \ v € L{G,ip,i) and w e R{G,ip,i) } |. 

In our notation in Definition!^ we have: 

C{G,ip,i) = \d{L{G,ip,i),R{G,ip,i)) \ = degree {L{G, ip, i)). 

The length ofWw in the linear arrangement {G,ip), denoted ^{G,ip,Ww), is “1 -i- the number of vertices 
between v and w”'. 

i{G,ip,vw) ■■= \ ip{v)-ip{w)\. 

The a-measure and /3-measure of the linear arrangement {G, ip) are defined by: 

a{G,ip) ;= max{ C(G, (/?, z) I 1 ^ f ^ n } = degree (^L{G, ip, i)) \ 1 ^ i ^ n^, 

P{G,p):=Y, {C{G,p,i)\l^i^n} = Y, {degree{L{G,p,i)) \l ^ i ^ n}. 

In the literature, a{G, ip) is called the cutwidth of the linear arrangement {G, p). The cost of the linear arrange¬ 
ment {G, p) is usually defined as the total length of all the edges relative to {G, p), i.e., the cost is the measure 
7 (G, p) given by: 

l{G,p) := Y{^^G,p,vw)\vw ^ E). 

However, by Lemmaj^below, j5{G,p) is equal to 7 (G, p). □ 

Lemma 8. For every linear arrangement {G, p), we have (3{G, p) = 'y{G, p). 

Proof. We have to prove that 

^{C(G,(/?,z) I = Y{^^G,p,vw)\vw ^ E). 

This equality holds whether G is connected or not. So, a formal proof (omitted) can be written by induction on 
the number m ^ 0 of edges in G. But informally, for every edge Ww e E, if p{v) = i and p{w) = j with i < j, 
then its length ^{G,p, vw) = j - i. In this case, the length of edge vw contributes one unit to each of (j - z) 
consecutive edge cuts: C{G, p,i),..., ({G, p,j - 1). Hence, if we delete edge vw from the graph, we decrease 
the two quantities: 

^ { C(G, z) I 1 ^ z ^ rz } and Y{C{G,p,vlU)\¥w € E }. 

by exactly the same amount (j - z). The desired conclusion follows. □ 

Definition 9 {Optimal linear arrangements). Let G = {V,E) be a simple undirected graph. We say the linear 
arrangement {G, p) is a-optimal if: 

a{G,p) = min I a(G, 7 )^) I (G, is a linear arrangement}. 

The a-optimal linear arrangement problem is the problem of defining a bijection p --V ^ {1,... ,n} such that 
(G, p) is a-optimal. We define similarly the jd-optimal linear arrangement problem. □ 


7 


Definition 10 {Linear arrangement induced by linear reassembling). Let G = {V,E) he, a. simple undirected 
graph and (G, C) be a linear reassembling of G. Using the notation in (|^ and Q in the opening paragraph of 
Section [23] the n leaf nodes (or singleton clusters) of £ are: Yo,Yi,Y 2 ,..., Un-i- Observe that the order of the 
singletons Y 2 ,, Yn-i and, therefore, the order of the (n - 2) vertices in >2 ■ ■ ■ u Yn-i is uniquely determined 
by the chain in (|^, but this is not the case for the order in which we write Yq and Yi, i.e., the first cluster Xi 
in is equal to both Yq u Yi and Yi u Uq- 

We want to extract a linear arrangement (G, (p) from the linear reassembling (G, C). This is achieved by 
defining ip -V ->• {1,..., n} as follows: 

• Lef Yq = {u} and Yi = {r;'}. 

- If degree{v) ^ degree{v') fhen sef <p{v) := 1 and ip>{v') ■= 2. 

- If degree{v) ^ degree{v') fhen sef ip{v') := 1 and ip{v) := 2. 

• For every 2^f^n-l, ifli = {?;} fhen sef <p{v) ■= i + 1. 

If is possible fhaf degree{v) = degree{v’), in which case tp may place v firsl and v' second, or v' firsl and v 
second. This ambiguify is harmless for our analysis, in fhaf if does nol affecl fhe a-measure and fhe /3-measure 
of fhe linear arrangemenf {G,p). 

Whefher p places v firsl and v' second, or v' firsl and v second, we call {G,p) a linear arrangemenf of G 
induced by fhe linear reassembling (G, £). 

Note fhaf, for every 1 ^ z ^ n - 1, we have fhe equably Xi = L{G, p, i), where L(G, p, i) is fhe sef of 
verfices al posifion i and lo fhe lefl of if, as in Definilion|7] □ 

Definition 11 {Linear reassembling induced by linear arrangement). Lef G = {V,E) be a simple undirected 
graph and lef (G, p) be a linear arrangemenf of G. We exlracl a linear reassembling (G, C) from (G, p). The 
n leaf nodes/singlefon clusters of £ are: 

{ {u} I z; e U} = { {p~^{i)} | 1 ^ i < n}. 

The (n - 1) non-leaf nodes/clusters of £ are: 


X, := 

1 

1—1 

1 


X 2 := 

£-'(2), £-'(3)}, 


:= 

{£-^1), £-'(2), £-'(3), ... 

, £ \n)} = V. 


We call (G, £) fhe linear reassembling of G induced by fhe linear arrangemenf (G, p). □ 

3 Examples 

We presenl several simple examples lo illuslrafe nofions menlioned in Secfions [T] and We firsl inlroduce a 
convenienl nolafion for specifying binary frees over a sef V of verfices. Lef denole fhe power sef of V. We 
define a binary operalion X y for all /F c 2 '^ and 3^ £ 2 ^ by: 

^ := (Xuy) u ((U‘^)u(U3^)) 

The examples below illuslrafe how we use Ihis operalion “ ”. 

We overload fhe overline nolafion “ ” fo denote a nonempty subsel of V, i.e., vi---Vk means {ui,..., v^}. 
In particular, fhe edge connecting v and w is denoled by fhe Iwo-vertex sef Ww = {u, w}. 

The confexl will make clear whefher we use “ ” lo refer lo a sef of subsels of V (in fhe case of binary 
frees) or fo jusl a subsel of V. 





Example 12. The hypercube graph is shown on the left in Figure[T] and three of its reassemblings are shown 
on the right in Figure [T] The top reassembling is neither balanced nor linear; the middle one is balanced', and 
the bottom one is linear. 



Figure 1: Two different drawings of the hypercube graph Q-^ (on the left) and three of its reassemblings (on the right). 


For each of the three reassemblings on the right in Figure [TJ from top to bottom, we list the unique 
binary tree B over the vertices {1,..., 8} underlying it (in its binary reassembling formulation) and one of the 
orderings 0 of the edges {1 2,..., 78} inducing it (in its sequential reassembling formulation): 


= 1 1234 

5 6 7 8 

II 

CO 

( 

0 

1 2 

34 

56 

78 

1 3 

35 

57 •• 


S 2 = { 12 3 4 

r - 

5 6 7 8 

II 

CO 

0 

1 2 

34 

56 

78 

1 3 

5 7 

35 ■■ 

■ (balanced) 

1 12 3 4 

5 6 7 8 

0<33 _ 

1 2 

1 3 

34 

3 5 

5 6 

5 7 

78 ■■ 

■ (linear) 


where, for simplicity, we write just “u” instead of the cumbersome {{u}} = v . Thus, for example, two of the 
subsets of Bi above appear as “ 1 2 ” and “ 1 2 3 4 ”, and if we expand them in full, we obtain: 

Ty={{l},{2},{l,2}} and TT IT = {{1}, {2}, {3}, {4}, {1,2}, {3,4}, {1,2,3,4}}. 

The ellipsis in the definition of 0^ above are the remaining edges of which can be listed in any 
order without changing the reassemblingjj A simple calculation of the a-measure and /3-measure of these three 
reassemblings of produces: 

a{Q'i,Bi) = a{Q 3 ,B 2 ) = 4 and a{Q‘i,B‘i) = 5, 
mM = = 48 and = 49. 

By exhaustive inspection (details omitted), (Qs, i3i) is both a-optimal and /3-optimal for the class of all binary 
reassemblings of Qa- Because the a-measure and /3-measure of (Q 3 ,Bi) and those of {Q 3 ,B 2 ) are equal, 

^ We qualify with the superscript “Qa” because it depends on the graph Q 3 . The same ordering of the edges may not be valid 
for a sequential reassembling of another 8-vertex graph with a set of edges different from that of Q 3 . This is not the case for the binary 
tree B underlying the binary reassembling (G, B) of a graph G = {V, E) \ that is, regardless of the placement of edges in G, the tree B 
over V is valid for the binary reassembling (G, B) and again for the binary reassembling {G', B) of every graph G' = {V, E') over the 
same set V of vertices. 
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{Qz-,^ 2 ) is also both a-optimal and /3-optimal for the class of all binary reassemblings and, therefore, for the 
smaller class of all balanced reassemblings of which it is a member. By exhaustive inspection again, 
is both a-optimal and /3-optimal for the sub-class of all linear reassemblings, but not for the full class of all 
binary reassemblings of Q 3 . □ 

Example 13 . The complete graph iTg on 8 vertices is shown in Figure We can carry out three reassemblings 
of iTg by using the binary trees 81,82, and 83, from Example [T^ again. 

A straightforward calculation of the a-measure and /3-measure of the 
resulting reassemblings (Kg,81), {Kg, 82), and {Kg, 83) produces 
the following values: 

a{Kg,8i) = a{Kg,82) = a{Kg,83) = 16, 

( 3 {Kg, 8 i)= 132, P{Kg,82)= 136, P{Kg,83) = 133. 

Because of the symmetries of Kg (“every permutation of the 8 
vertices produces another graph isomorphic to Kg'), all balanced 
reassemblings are isomorphic and so are all linear reassemblings. 

Hence, {Kg, 82) is trivially a-optimal and /3-optimal for the class of 
all balanced reassemblings of Kg, and {Kg, 83) is trivially a-optimal 
and /3-optimal for the class of all linear reassemblings of Kg. 

By exhaustive inspection (details omitted), it turns out that {Kg, 81 ) is a-optimal for the class of all binary 
reassemblings, but it is not /3-optimal for the same class. The underlying tree of a /3-optimal binary reassembling 
of Kg turns out to be the following 8 a over the vertices {1,..., 8 }, shown with an ordering 0^® of the edges 
which induces a sequential ordering equal to {Kg, 84): 

84 = I TT TT TT 7 8 ^ ^ ^ ^ ^ ^ ■■■ 

where the ellipsis “...” are the remaining edges in any order. The resulting /3-measure is l 3 {Kg, 84) = 127. □ 

Example 14 . The star graph Sj, with 7 leaves and one internal vertex, is shown in Figure We can carry out 
four reassemblings of S^ by using the binary trees 84,82,83, and 84, from Examples [T^ and [T3] again. 

From a straightforward calculation of the a-measure and /3-measure 
of the reassemblings {Sj, 84), {87,82), {87,83), and {87,84): 

a{ 87 , 84 ) = a{ 87 , 82 ) = a{ 87 ,B 3 ) = a{ 87 ,B 4 ) = 7 , 

P{ 87 ,B 4 )= 32 , /3(57,H2)= 34, 

33 ( 57 ,^ 3 )= 35, 33(57,^4)= 31. 

The preceding four reassemblings are all a-optimal, each for its own 
class of reassemblings, i.e., {87,82) is a-optimal for the class of 
balanced reassemblings of 87 and {87,83) for the class of linear 
reassemblings of 87. It turns out that only {87,82) is /3-optimal for 
its own class, the class of balanced reassemblings of 87. None of the 
four is /3-optimal for the class of all binary reassemblings of 87. 

Of the four binary reassemblings above, only {87,83) is strict, the three other are not strict, i.e., the three 
other merge some cluster pairs {A, B) such that d{A, B) = 0. Because {87,84), {87,82), and {87,84) are 
not strict, it is not possible to re-define them as sequential reassemblings, each relative to an appropriate edge 
ordering. An ordering of the edges that induces a sequential reassembling equal to {87,83) is: 

ef = TT TT TT TT TT TT TT. 
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By exhaustive inspection (details omitted), the following is a binary tree over the vertices { 1 ,... ,8} such 
that (S'7, B^) is / 3 -optimal for the class of all binary reassemblings of St. Since (S'7, B^) is not strict, there is 
no corresponding ordering of the edges: 


B^ = 


2 3 4 1 5 6 7 8 


The resulting /3-measure is f5{ST,B^) = 29. Note that B^ is also linear. Hence, {Sj^B^) is also /3-optimal for 
the class of linear reassemblings of ^ 7 . □ 

Example 15. The binary tree H 3 in Examples \V2\ [T3j and[^ is a linear binary tree. Written in full, using the 
notation in at the beginning of Section [23] the non-singleton sets of H 3 are: 

= TT, X 2 = TT^, X 3 = 1 234, X 4 = 1 2345 , 

X 5 = 1 23456, Xe = 1 2 345 6 7, X-j = 1 2345 6 78. 

The singleton sets of B^ are: 

yo = 1 , n = 2, y 2 = 3 , y3 = 4, y4 = 5 , yg = 6, ye = 7 , y7 = 8. 

The linear arrangement Lp^ induced by the linear reassembling (^ 7 , 133 ) in Example [T4| is (see Definition [lO]): 

(/P3(2) = 1, <^ 3 ( 1 ) = 2, ¥^ 3 ( 3 ) = 3, ¥^ 3 ( 4 ) = 4, ^ 3 ( 5 ) = 5, (^ 3 ( 6 ) = 6 , ¥^ 3 ( 7 ) = 7, ^ 3 ( 8 ) = 8 , 

rather than the linear arrangement (p'\ 

(/.'(!) = 1, <^:j(2) = 2, (^:j( 3) = 3, ¥^^(4) = 4, = 5, (^^^(fi) = 6, ¥^^(7) = 7, <^:j(8) = 8, 

because degree^.^{2) = 1 < 7 = degree The difference between ip^ and ip'^ is in the placement of the two 
first vertices: vertex “1” and vertex “2”. □ 


Example 16. The binary tree B^, in Example [T4| is a linear binary tree. As in Example [TSj it is straightforward 
to specify the singleton and non-singleton sets of B5 (omitted here) to fit the notation of (|^ and (|^ at the 
beginning of Section There are two possible linear arrangements, ip^ and (p'^, which are induced by the 
linear reassembling {St,B^), because degree^.^{ 2 ) = degree= 1 (see Definition 101 , namely: 


7^5(2) = 1, 
7^U3) = 1, 


<7’5(3) = 2, 

<^ 5 ( 2 ) = 2 , 


<7>5(4) = 3, 

7’U4) = 3, 


7^5(1) = 4 , 
7^5(1) = 4 , 


7^5(5) = 5 , 
T^^S) = 5 , 


7^5(6) = 6, 
7^U6) = 6, 


7^5(7) = 7 , 
7^5(7) = 7 , 


795(8) = 8, 

7^^(8) = 8. 


The difference between ip^ and yjg is in the placement of the two first vertices: vertex “ 2 ” and vertex “ 3 ”. In 
contrast to (ps and in Example both p^ and p’^ are valid as linear arrangements induced by the linear 
reassembling (S'7, H5). A comparison between p-^ and <^95 is shown in Eigure|^ □ 


Example 17. This is a continuation of Example [TS] The linear reassembling induced by the linear arrangement 
(S'7, (^3) is precisely (Sj^Bs), but so is the linear reassembling induced by the linear arrangement {S7,p'^) 
again the same {St,Bs), according to Definition[^ This means: linear arrangements make distinction that 
linear reassemblings do not make. This difference is in the placement of the two first vertices, specifically: 

• The a-measure and / 3 -measure of a linear arrangement generally depend on which vertex is placed first 
and which is placed second. 

• The a-measure and / 3 -measure of a linear reassembling do not distinguish between a first and second 
vertex and do not depend on which is placed first and which is placed second. 
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Figure 4: Comparison of linear arrangements {S 7 , in Example [l^ and (Sy, in Example [T^ 


As an example, consider the linear arrangements (Sy, (/Ja) and (Sy, </? 3 ). Their a-measure and /3-measure are: 

=6, /3(5y,<y93) = 22, 

= 7, /3(5y,<y93) = 28. 

For the linear reassembling { 8 ^, 83 ) induced by both { 87 , (ps) and { 87 , p'^), we have: a{ 87 ,B^) = 7 and 
( 5 ( 87 , 83 ) = 35, as noted in Example [l4| Moreover, while both ( 87 ,^ 3 ) and { 87 ,(f'^) are neither a-optimal 
nor /3-optimal, ( 87 ,83) is a-optimal (though not /3-optimal). □ 

4 ct-Optimization of Linear Reassembling Is NP-Hard 

We prove that the a-optimality of linear arrangements (in the literature: the minimum-cutwidth linear arrange¬ 
ment problem) and the a-optimality of linear reassemblings are reducible to each other in polynomial time. 

Definition 18 {Chordal graph, triangulation, clique number, treewidth). Let G = (V, E) be a simple undirected 
graph. The following are standard notions of graph theory [5l. 

• G is a chordal graph if every cycle of length of 4 or more has a chord, i. e. , an edge connecting two vertices 
that are not consecutive in the cycle. 

• A triangulation of G is a chordal graph G' = (V',E') where V = V' and E E'. In such a case, we say 
that G can be triangulated into G', not uniquely in general. 

• The clique number of G, denoted oj(G), is the size of a largest clique in G. 

• There are different equivalent definitions of the treewidth. We here use a definition, or a consequence of 
the original definition, which is more convenient for our purposes |l3l|5l. The treewidth of G is: 

min { a;(G^) I GMs a triangulation of G } - 1. 

In words, among all triangulations G' of G, we choose a G' whose clique number is smallest: The treewidth of 
G is one less than the clique number of such a G'. □ 

Lemma 19 . For every positive integers A and k, there is an algorithm A using A and k as fixed parameters, 
such that, given an arbitrary simple undirected graph G = (V,E) as input to A if 

1. the maximum vertex degree in G is ^ A, and 

2. the treewidth of G is ^ k. 
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then A computes a minimum-cutwidth linear arrangement ofG in time where n=\V\. 

Proof. This is Theorem 4.2 in ifT^ . where the algorithm not only computes the value of a minimum cutwidth, 
but can be adjusted to output the corresponding minimum-cutwidth linear arrangement p --V ^ {1,..., n). □ 

In Lemma [20l a cut vertex in G is a vertex whose removal increases the number of connected components. 

Lemma 20 . There is an algorithm A such that, given an arbitrary simple undirected graph G = {V,E) as 
input to A if' 

1. every vertex in G has degree ^ 3, and 

2. every vertex in G of degree = 3 is a cut vertex, 

then A computes a minimum-cutwidth linear arrangement of G in time where n=\V\. 

Proof We show that the treewidth A: of G is ^ 2. Because the maximum vertex degree A of G is ^ 3, Lemmafl^ 
implies the existence of an algorithm A which runs in time ) = o( n^^). 

To show that k ^ 2, consider a vertex v of degree = 3, which is therefore a cut vertex. The removal of v 
can have one of two possible outcomes: 

(a) disconnect G into 3 components, or 

(b) disconnect G into 2 components. 

If every vertex of degree = 3 satisfies condition (a), then G is tree whose treewidth is 1, since its clique number 
a;(G) = 2 in this case. 

If Gi and G 2 are cycles in G, each with 3 vertices or more, then Gi and G 2 are non-overlapping, i.e., Gi 
and G 2 have no vertex in common and no edge in common. If they have an edge WpUJi in common, then there 
is an edge vfwi e Gi n G 2 such that degree{v 2 ) = 3 (or, resp., degree{w 2 ) = 3) and V 2 (or, resp., W 2 ) is not a cut 
vertex, contradicting the hypothesis. If Gi and G 2 have no edge in common, but do have a vertex v in common, 
then degree{v 2 ) > 3, again contradicting the hypothesis. 

In case one or more vertices satisfy condition (b), G can be therefore viewed as a finite collection of 
non-overlapping rings {i?i,..., i?p}, each ring being a cycle with at least 3 vertices, satisfying condition (c): 

(c) if two distinct rings {Ri,Rj}, with i 4^ j, are connected by a path Pij, then the removal of all the vertices 
and edges of Pij (in particular the two endpoints of Pij, one in Ri and one in Rj, which are necessarily 
vetices of degree = 3) disconnects G into 2 components. 

Another way of expressing (c) is that, if all the rings {Ri,..., Rp} are each contracted to a single vertex, then 
the result is a tree (where some of the internal vertices may now have degree larger than 3). Since the clique 
number of a ring is 3, the treewidth of a ring is 2, and the desired conclusion follows. □ 

Lemma 21 . Let G = {V, E) be a simple undirected graph, where every vertex has degree ^ 3, and let (G, C) 
be a linear reassembling ofG. Consider the longest chain of nested clusters of size ^ 2, as in @ in the opening 
paragraph of Section \2.3\ 


Xi ^ A2 ^ ^ = K 

Conclusion: If there is one vertex of degree = 3 in G which is not a cut vertex, then 
max { degree(Ai),..., degree(An-i)} ^ 3. 

Proof We first show there are least two vertices of degree = 3 which are not cut vertices. Let n be a vertex 
of degree = 3 which is not a cut vertex, and let {v~x, vy, Wz} be the three edges incident to v. Because v is 
not a cut vertex, any two edges in {vx, Wy, Wz} are consecutive edges in a cycle containing v. Let G(v, x, y) 
be a cycle containing edges {Wx,vy}, and define similarly cycles G{v,x,z) and G{v,y,z). If any of fhese 
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three cycles contains a chord, then the two endpoints of the chord are vertices of degree = 3 which are not cut 
vertices. If none of these three cycles contain a chord, then we can combine any two of them, because they 
share an edge, to form another cycle with a chord, which again implies the existence of two vertices of degree 
= 3 which are not cut vertices. 

To conclude the proof, consider the clusters of £ of size ^ 2: These are {Xi,... and the corre¬ 

sponding singleton clusters are {Iq) • • •) Yn-i}, as in (|^ and (|^ in Section 2.3 By the preceding argument, 
there are at least two vertices of degree = 3 which are not cut vertices. Let one of these two be v, with Yi = {v) 
for some i ^ 1. 

We have n Yj = 0 and u = X^. There are two cases: (1) For some vertex w e Xj-i, there is 
an edge Ww e E, and (2) for every vertex w e Xj-i, there is no such edge. We consider case (1) and leave the 
other (easier) case (2) to the reader. 

We cannot have degree{Xi-i) = 0, otherwise G is disconnected, nor can we have degree{Xi-i) = 1, 
otherwise n is a cut vertex. Hence, degree{Xi-i) ^ 2. If degree{Xi-i) ^ 3, this is already the conclusion of the 
lemma and there is nothing else to prove. Suppose degree(Xi^i) = 2, the case left to consider. 

Similarly, we cannot have degree{Xi) = 0, otherwise G is disconnected, nor degree{Xi) = 1, otherwise v 
is a cut vertex. Hence, degree{Xi) ^ 2. But if degree{Xi-i) = 2 and degree{Xi) ^ 2, with degree{v) = 3, then 
it must be that degree {Xi) = 3. □ 


Lemma 22 . Let G = {V,E) be a simple undirected graph, where every vertex has degree ^ 3 and where one 
vertex of degree - 3 is not a cut vertex. Let {G, C) be a linear reassembling and {G, p) a linear arrangement. 
Conclusion: If{G, C) is induced by {G, p), or if{G, p) is induced by {G, C), then: 

• a{G,£) = a{G,p). 

• {G, C) is a-optimal iff{G, p) is a-optimal. 


Proof. Straightforward consequence of Lemma [2T| the definitions of a{G,C) and a{G, p), and what it means 
for {G,C) to be induced by {G,p) and for {G,p) to be induced by {G,C). When there is at least one vertex 
of degree = 3 which is not a cut vertex, and therefore at least two of them by the proof of Lemma we can 
ignore the degrees of singleton clusters in the computation of a(G, £). All details omitted. □ 


Theorem 23 . For the class of simple undirected graphs G where every vertex has degree ^ 3, the a-optimality 
of linear arrangements {G, p) is polynomial-time reducible to the a-optimality of linear reassemblings {G, C). 


More explicitly, a polynomial-time algorithm A, which returns an a-optimal linear reassembling (G, £) 
of a graph G where every vertex has degree ^ 3, can be used to return an a-optimal linear arrangement {G,p). 


Proof. Consider an arbitrary G where every vertex has degree ^ 3. If every vertex in G of degree = 3 is a 
cut vertex, we use the algorithm in Lemma 20 to compute an a-optimal linear arrangement {G,p) in time 
0{n^^). If there is a vertex in G of degree = 3 which is not a cut vertex, we first compute an a-optimal linear 
reassembling {G,£) and then return the linear arrangement (G, p) induced by (G, £). The desired conclusion 
follows from Lemma [22l □ 


Corollary 24 . For the class of all simple undirected graphs G, the computation of a-optimal linear reassem¬ 
blings {G,£) is an NP-hardproblem. 


Proof. If there is a polynomial-time algorithm A to compute, for an arbitrary simple undirected graph, an 
a-optimal linear reassembling, then the same algorithm A can be used to compute in polynomial-time an a- 
optimal linear reassembling (G, £) for a graph G where every vertex has degree ^ 3. By Theorem]^ A can 
be further adapted to compute an a-optimal linear arrangement (G, p) for such a graph G in polynomial time. 
But the latter problem (in the literature: the minimum-cutwidth linear arrangement problem) is known to be 
NP-hard EOllIIl. □ 
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Remark 25 . To the best of our knowledge, the complexity status of the minimum-cutwidth linear arrangement 
problem for /c-regular graphs for a fixed /c ^ 3 is an open problem. If it were known to be NP-hard, we would 
be able to simplify our proof of Theorem 23 and its corollary considerably. In particular, we would be able 
to eliminate Lemmas [T^ and 20 and the supporting Definition [T8| as well as simplify Lemmas 21 and [2^ by 
restricting them to /c-regular graphs. □ 


Theorem [2^ and Corollary [2^ together say the a-optimality of linear arrangements (G, (p) is polynomial¬ 
time reducible to the a-optimality of linear reassemblings (G, C). For completeness, we show the converse in 
the next theorem. 


Theorem 26 . For the class of simple undirected graphs G in general, the a-optiniality of linear reassemblings 
(G, C) is polynomial-time reducible to the a-optimality of linear arrangements (G, (p). 

Proof In Appendix]^ □ 


5 ^^-Optimization of Linear Reassembling Is NP-Hard 


We prove that the 15 -optimality of linear arrangements (in the literature: the minimum-cost linear arrangement 
problem or also the optimal linear arrangement problem) and the ( 5 -optiniality of linear reassemblings are 
reducible to each other in polynomial time. Towards this end, we prove an intermediate result, which is also of 
independent interest (Theorem which presupposes Definition [27]). 

Definition 27 (Anchored linear reassemblings). Let G = (L, i7) be a simple undirected graph and let w € V. 
Let (G,C) be a linear reassembling of G, whose longest chain of nested clusters of size ^ 2, as in (j^ in the 
opening paragraph of Section [23] is: 

X, ^X2 ^ ^ Xn-l = V 


and whose corresponding singleton clusters are {Tq, • • • j Xn-i}, as determined by Q in the opening paragraph 
of Section 2.3 We say (G, C) is a linear reassembling anchored at w €V iff there is a vertex w' &V such that: 

^0 = and degreeQ(w) ^ degreeQ(w'). 


Note that we require that the immediate sibling Y\ = {ru'} of the leaf node Yq = {m} satisfy the condition 
degree q(w) ^ degree q(w'). This implies that, given an arbitrary vertex w € V, we cannot anchor a linear 
reassembling at w unless we find another vertex w' € V such that degreeQ(w) ^ degreeQ(w') and then make 
{m} and {m'j sibling leaf-nodes. This is a technical restriction to simplify the statement of Lemma 
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□ 


Definition 28 (Anchored linear arrangements). Let G = (V,E) be a simple undirected graph and let w € V. 
Let (G, p) be a linear arrangement of G. We say (G, p) is a linear arrangement anchored at w €V iff there is 
a vertex w' such that: 


p(w) = 1, p{w') = 2, and degreeQ(w) ^ degreeQ(w’). 

Again, as in Definition the condition degreeQ(w) ^ degreeQ(w') is imposed in order to simplify the 
statement of Lemma[^ It is worth noting that, if we relax this condition and allow degree q(w) > degree q(w'), 
then the new arrangement p' which permutes the positions of w and w', i.e.: 


p'(w') = l, p'(w) = 2, and p'(v) = p(v) ^or: aWv - {w,w'}, 


®Thus, we cannot say that the linear reassembling ( 87 , 33 ), in Examples j 14] and 15 is anchored at vertex “1”, though we can say 
that ( 87 , 63 ) is anchored at vertex “2”. More generally, in the case of a star graph 8 k with fc ^ 3 leaves: There is no linear reassembling 
of 8 k anchored at the internal vertex of 8 k- 
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is such that /?(G', ^p') < P{G, (pY In words, if we allowed degreeQ{w) > degreeQ{w'), the linear arrangement 
{G, ip) would not be /^-optimalH □ 

Example 29, Consider the linear reassemblings (8^,83) and (8^,85) in Example [l4| {87,83) is anchored at 
vertex “ 2 ”, but cannot be anchored at vertex “ 1 ”, while {87,83) is anchored at vertex “ 2 ”, and can be anchored 
again at vertex “3”. Both {87,83) and {87,83) are a-optimal and, a fortiori, a-optimal for the class of all 
linear reassemblings of 87 anchored at vertex “2”. Moreover, I3{87,83) = 35 and I3{87,83) = 29, so that 
{87,83) is not /3-optimal for the class of all linear reassemblings of 87 anchored at vertex “ 2 ”, while {87,83) 
is /3-optimal for the same class. 

Consider now the linear arrangements (S'7, (/^s) and (Sy, </?5) induced by the linear reassemblings {87,83) 
and {87,83), respectively. <p3 and ip3 are given in Example [Ts] and Example [T^ Both {87, ip3) and {87, (P3) 
are anchored at vertex “2”. Moreover, {87, tp3) cannot be anchored at vertex “1” (the sibling leaf of “2” in (^ 3 ), 
while {87,ip3) can be anchored again at vertex “3” (the sibling leaf of “2” in ip3). 

{87,ip3) is neither a-optimal nor /3-optimal for the class of all linear arrangements of 87 anchored at 
“ 2 ”; hence, {87, (P3) is neither a-optimal nor /3-optimal for the super-class of all linear arrangements of 87. 
By contrast, {87,ip3) is both a-optimal and /3-optimal for the class of all linear arrangements of 87', hence, 
(S'r, (^ 5 ) is both a-optimal and /3-optimal for the sub-class of all linear arrangements of 87 anchored at “ 2 ”. □ 

Eet {G,C) be a linear reassembling anchored at vertex w € V. We say {G,C) is fd-optimal relative to 
anchor w iff: 

P{G,£) = min{p{G,G)\{G,£' ) is a linear reassembling anchored at rc }. 

Clearly, (G, E) is a ^-optimal linear reassembling, with no anchor restriction, iff: 

/3(G, C) = min { I 3 {G, C') | there is a vertex w and 

(G, is a linear reassembling /3-optimal relative to anchor w }. 

Similar obvious conditions apply to what it means for (G, p) to be a jd-optimal linear arrangement relative to 
anchor w. 

Lemma 30. Let G = {V, E) be a simple undirected graph and w €V. Let (G, C) be a linear reassembling of 
G anchored at w, and (G, <p) be a linear arrangement ofG anchored at w, such that: 

(G, ip) is induced by (G, C) or {G,C) is induced by (G, ip). 

Conclusion: (G, C) is (^-optimal relative to anchor w iff{G, ip) is (^-optimal relative to anchor w. 

Proof. Eet d = degree{w) ^ 1 and A = '£,{d^S^^^{v) \ v ^ V and v 4 ^ w}. Consider the case when arrange¬ 
ment (G, ip) is induced by reassembling {G,C). (We omit the case when reassembling {G,C) is induced by 
arrangement (G, ip), which is treated similarly.) Erom Definitionj^ 

/3(G, C) = d + A + Y^{ degree{Xi) |l^i<n-l} 

where Xi,..., X^-i are all the clusters of size ^ 2 in G. Erom Definitions [7] and [TOl 

/3(G, ip) = d+Y^{ degree{Xi) |l<i^n-l}. 

Hence, both j 5 {G,C) and l 3 {G,ip) are minimized when the same quantity Y,{degree{Xi) 11 ^ i ^ n - 1} is 
minimized. The desired conclusion follows. □ 

’a similar statement applies to the a-measure: If we allowed degreeQ^w) > degreeQ^w'), then the new arrangement p' would be 
such that a{G, ip') a{G, ip), but not necessarily a{G, ip') < a{G, ip). 
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Remark 31 . There is an obvious definition of anchored a-optimality, similar to that of anchored ^-optimality 
above. However, results for the latter do not necessarily have counterparts for the former. In particular, the con¬ 
clusion of Lemma 1^ does not hold for a-optimality. Specifically, there are simple counter-examples showing 
the existence of a simple undirected graph G{V, E) with a distinguished vertex w €V such that: 

• there is a linear reassembling {G,C) which is a-optimal relative to anchor w, 

• but the linear arrangement {G, p) induced by (G, C) is not a-optimal relative to anchor w. 

Such a counter-example is the linear reassembling and the linear arrangement it induces, in 

Example [29] both anchored at vertex “2”: the former is a-optimal for the class of all linear reassemblings of Sy 
anchored at “2”, the latter is not a-optimal for the class of all linear arrangements of 87 anchored at “2”. 

There is an examination, yet to be undertaken, of the relation between linear reassemblings (G, £) and 
linear arrangements {G,p) that are a-optimal relative to the same anchor, similar to our study of anchored 
/3-optimality below. This examination we do not pursue in this report. □ 

Theorem 32 . For the class of all simple undirected graphs G = {V, E), each with a distinguished vertex w €V, 
the two following problems are polynomial-time reducible to each other: 

• the (^-optimality of linear arrangements (G, p) anchored at w, 

• the (i-optimality of linear reassemblings {G,C) anchored at w. 


More explicitly, a polynomial-time algorithm A, which returns a linear reassembling {G,C) [resp. a linear 
arrangement (G, p)^ which is /3-optimal relative to anchor w can be used to return in polynomial time a linear 
arrangement (G, p) [resp. a linear reassembling (G, £)] which is /3-optimal relative to anchor w. 


Proof This is an immediate consequence of Lemma [30| 


□ 


Definition 33 {Auxiliary graphs). Let G = {V,E) be a. simple undirected graph, with \ V\ = n and \E \ = m. 
Lor every m e L we define what we call an auxiliary graph Gw = {Vw,Ew) as follows: 

• Vw ■=V aU where [/ is a fresh set of p = X] { degreeQ{v) | u e L } vertices. 

• Ew ■= E& Dw where Dw ■= { uw | u e [/} u { Wfvf | ui, tt2 e G and uit U2}- 

Thus, the subgraph of Gw induced by the set V is simply the original graph G, and the subgraph of Gw induced 
by the set U u {m} is the complete graph itTp+i over p - 1-1 vertices. 

Informally, Gw is constructed from G and the complete graph Kp+i by identifying vertex w &V with one 
of the vertices of Kp^i. In particular, m is a cut vertex of the auxiliary graph Gw- We call w, which is the 
common vertex of G and Kp+i, the distinguished vertex of Gw- LI 


Lemma 34 . If Gw = {Vw, Ew) is the auxiliary graph for vertex w € V, as constructed in Definition 33 then 


I Vw I ^ and I E,, 


^ {n^ -2n^ -2n)l2. 


Proof The number m of edges in G is bounded by {nf - re)/2. Hence, p = | u e H} ^ (n^ - re), 

implying that the total number of vertices p + re in Gw is ^ (re^ - n) + n = nf. The number of edges in Kp is 
(p^ -p)/2, and in Kp+i it is (p^ +p)/2, which is ^ ((re^ -n)"^ + (re^ - re))/2 = {nf - 2 n^ + 2re^ - re)/2. Hence, 
the total number of edges in Gw is rre - 1 - (p^ +p)/2 ^ {nf - 2 n^ + 3re^ - 2re)/2. □ 


Let £ be a linear binary tree over V where, as in (|^ in the opening paragraph of Section 2.3 the longest 
chain of nested clusters of size ^ 2 is: 


X, ^X2 ^ ^ Xn-l = V, 
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and let the corresponding singleton clusters be {Iqi • • • ,Yn-i} as determined by The linear tree L is 
uniquely determined by a sequence of vertices written in the form: 

■■■ Vn\ 

where Yq = {ui}, Yi = {^ 2 }, • • ■, = {vn}- We say [ui ■■■ u„] is the vertex sequence induced by C, and C, 

the linear reassembling (or the linear binary tree) induced by the vertex sequence [ui ■■■ Vn\- 

Similarly, if y?: 1/ {1,..., n} is a linear arrangement of V, then if is uniquely determined by a sequence 

of vertices in the same form: 

[Ul ■■■ Vn\ 

where (^“^(1) = = V 2 , ■ ■ ■, = Vn- We say [ui ■■■ Vn\ is the vertex sequence induced by ip, and 

If the linear arrangement induced by the vertex sequence [ui ■■■ Vn\. 

For the auxiliary graph G^, whether we deal with a linear reassembling (G^, C) or a linear arrangement 
{Gw, f), it is convenient to consider sequences of the following form, which interleaves vertices and cutwidths: 

(❖) S := [xi (ri,si) X 2 (r 2 ,S 2 ) . ^n+p-l (^n+p-1) ^’n+p-l) ^n+p] 

where {xi,..., Xn+p} = Vw = {ui,... u {m,... ,Up}, and for every l^i^n + p-l: 

ri := degreeQ{{xi,...,Xi}) and Si := degree p;;^^^{{xi,... ,Xi}). 

We say the sequence S in Q is the sequence of vertices and cutwidths induced by {Gw,G) or by {Gw,f), 
whichever of the two is the case. The measure /3 on 5 is: 

PiS) ■■= Y. (ri + Si). 

l^i^n+p-1 

Lemma 35 . Consider the sequence of vertices and cutwidths induced by {Gw,G) orby {Gw,f), as just defined. 

Conclusion: 

• For every l^i^n + p-1, U holds that + Sj = degreeQ^{{xi,... ,Xi}). 

• If the sequence in Q is induced by the linear arrangement {Gw, f), then 

l3{Gw,f) = f3{S) = Y {c + Si). 

l^i^n+p-1 


• If the sequence in Q is induced by the linear reassembling {Gw, G), then 

fj{Gw,C) = A + ^{S) = A + Y ic + si), 

l^i^n+p-1 

where A = X!{degreeQ^(u) \ v ^Vw and v xi). 

Proof. Straightforward consequence of the definitions. All details omitted. □ 

We say that the sequence S is scattered if the vertices of iTp+i do not occur consecutively, i.e., the vertices 
of iTp+i are interspersed with vertices of L - {m}. 

Lemma 36. Let Gw = {Vw,Ew) be the auxiliary graph for vertex w €V, as constructed in Definition Let 
S be the sequence of vertices and cutwidths, as in Q, induced by a ^-optimal linear reassembling {Gw,L) or 
by a jj-optimal linear arrangement {Gw,f). Conclusion: S is not scattered. 
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In words, in a /^-optimal linear reassembling (G^u, £) [or in a /3-optimal linear arrangement (G^, (/?), resp.] 
all the vertices of Kp+i are reassembled consecutively [or arranged consecutively, resp.] without intervening 
vertices from V - {w}. 

Proof. In Appendix]^ □ 

Consider again the sequence S of vertices and cutwidths in 0- Suppose S is not scattered. This means 
that the p+1 vertices of iTp+i occur consecutively in S. We say S is balanced iff one of two conditions holds: 

(1) {Xl,. . . ,Xn-l} =V - {w}, {Xn} = {w}, {Xn+l, ■ ■ ■ , Xn+p} = U, 

(2) {xi,...,Xp} =U, {Xp+i} = {ru}, {Xp+ 2 ,---,Xn+p} = v-{w}. 

In words, S is balanced if all the vertices of 1/ - {m} are on the same side (on the left in (1), or on the right in 
(2)) of the distinguished vertex w and all the vertices of U are on the other side (on the right in (1), or on the 
left in (2)) of w. Put differently still, S is balanced if all the vertices oiV - {tu} are together, all the vertices of 
U are together, and w is between the two sets of vertices. The following is a refinement of the preceding lemma 
and its proof is based on a similar argument. 

Lemma 37. Let Gw = {Vw,Ew) be the auxiliary graph for vertex w €V, as constructed in Definition Let 
S be the sequence of vertices and cutwidths, as in 0, induced by a ^-optimal linear reassembling (Gw,P) or 
by a jd-optimal linear arrangement (Gw,p). Conclusion: S is balanced. 

Proof. In Appendix]^ □ 

By the preceding lemma, if the sequence S in 0 is induced by a /3-optimal linear reassembling (G^, £), 
or by a /3-optimal linear arrangement {Gw,p), then S is balanced, either on the left or on the right. For the rest 
of the analysis below, we assume that S is balanced on the right, i.e., all the vertices in U occur first, then w, 
and then all the vertices of 1/ - {tt;}. 

Definition 38 (Restrictions of linear reassemblings and linear arrangements). Let £ be a linear binary tree 
over the set V. If V £ V, the restriction of £ to V', denoted (£ | V'), consists of the following clusters: 

(£|L') := {XnV'\X€C]. 

It is a straightforward exercise to show that (£ 11/') is a linear binary tree over V'. 

Let p ■ V ->■ {1,..., n} be a linear arrangement of V. The restriction of p to V', denoted ((/? | y'), is 
defined as follows. For every 1 ^ f ^ n' = | L' |, let: 

{^p\V'^{v) ■= i where ti = and y e {1 ,..., n} is 

the largest integer such that |{(/?“^(1 ),..., 

Again here, it is straightforward to show that ((^ | L') is a linear arrangement of V' such that: 

(^ip\V') ^(1), ... , [ip\V') ^{n') is a subsequence of ... , Lp~^{n). 

Moreover, if (G, £) is a linear reassembling [resp. (G, p) is a linear arrangement] of the simple undirected 
graph G = {V,E) and G' = {V',E') is the subgraph of G induced by V c V, then (G', (£ | L')) is a linear 
reassembling [resp. (G', (p | V')) is a linear arrangement] of G'. □ 

Lemma 39. Let Gw = {Vw, Ew) be the auxiliary graph for vertex w ^V, as constructed in Definition\3^ 

1 . If {Gw,E) is a j5-optimal linear reassembling of Gw with no anchor restriction, then (G, (£|y)) is a 
^-optimal linear reassembling relative to anchor w. 
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2. If is a ^-optimal linear arrangement of Gyj with no anchor restriction, then (G, {p \ 1/)) is a 

^-optimal linear arrangement relative to anchor w. 


Proof We prove part 1 only, the proof of part 2 is similar. By Lemma the sequence S induced by a (3- 
optimal linear reassembling (G^, £) is balanced. By our assumption preceding Definition 27 we take S to be 
balanced on the right, i.e., all the vertices in U occur first, then w, and then all the vertices of 1/ - {m}. There 
are no edges connecting vertices in U on the left to vertices in 1/ - {m} on the right, with w a cut vertex in the 
middle. The /3-optimality of (G^, £) implies the /3-optimality of the linear reassembling (G, (£ 117)) of the 
subgraph G = (17,77) of G^ = (14), E.uf). We omit all formal details. □ 


Theorem 40 . For the class of all simple undirected graphs G, the two following problems are polynomial-time 
reducible to each other: 

• the f3-optimality of linear arrangements (G, (p), 

• the f3-optimality of linear reassemblings {G,C). 


More explicitly, a polynomial-time algorithm A, which returns a /3-optimal linear reassembling (G,£) [resp. 
a /3-optimal linear arrangement (G, y?)] of an arbitrary graph G, can be used to return a /3-optimal linear 
arrangement (G, p) [resp. a /3-optimal linear reassembling (G,£)] in polynomial time. 

Proof We compute a /3-optimal linear reassembling (G^., Ci) [resp. a /3-optimal linear arrangement (G^., pi)^ 
of the auxiliary graph G^., one for each vertex Uj e 17 = {ui,..., Vn}- We next consider the linear reassembling 
(G, {Ci 117)) [resp. the linear arrangement (G, {pi \ 17))] which, by Lemmais a /3-optimal linear reassem¬ 
bling relative to anchor Vi [resp. a /3-optimal linear arrangement relative to anchor Vi], for every I ^ n. Let 
(G, Pi) be the linear arrangement induced by the linear reassembling (G, {Ci \ 17)) [resp. let (G, Ci) be the 
linear reassembling induced by the linear arrangement (G, {pi \ 17))]. By Lemma(G, pi) is a /3-optimal 
linear arrangement relative to anchor Vi [resp. (G, £*) is a /3-optimal linear reassembling relative to anchor Vi\, 
for every 1 ^ i ^ n. Among these n linear arrangements [resp. n linear reassemblings], we choose one such 
that /3(G, Pi) is minimized [resp. /3(G, Ci) is minimized]. □ 

Corollary 41 , For the class of all simple undirected graphs G, the computation of /3-optimal linear reassem¬ 
blings {G,C) is an NP-hardproblem. 


Proof This follows from the NP-hardness of the minimum-cost linear arrangement problem (also called the 
optimal linear arrangement problem in the literature) @. This problem is the same as what we call, in this 
report, the problem of computing a /3-optimal linear arrangement. □ 


Remark 42 . To the best of our knowledge, the complexity status of the minimum-cost linear arrangement 
problem (or optimal linear arrangement problem) for fc-regular graphs for a fixed /c ^ 3 is an open problem. 
If it were known to be NP-hard, we would be able to simplify our proof of Theorem 40 and its corollary 
considerably. □ 


6 Related and Future Work 


We mentioned several open problems from the literature, still unresolved to the best of our knowledge, in 
Remarks [25] [3T| and[^ If these open problems were solved, partially or optimally, they would permit various 
simplifications in our proofs. In particular, even though one of our reductions can be carried out in polynomial 
time by invoking an earlier result on cutwidths (Lemmas [T^ and [20)), its 0{n^^) complexity is prohibitive (see 
the proof of Theorem 23); this is the reduction that reduces the a-optimality of linear arrangements to the 
a-optimality of linear reassemblings. 


20 





Beyond open problems whose resolutions would simplify and/or strengthen some of this report’s results 
and their proofs, our wider research agenda is to tackle forms of graph reassembling other than linear - in 
particular, balanced reassembling and binary reassembling in general, both strict and non-strict, all alluded to 
in Sections [T] 1^ and|^ For each form of reassembling, both a-optimization and /^-optimization will have to 
be addressed; as suggested by the examination in this report, these two optimizations seem to call for different 
proof methods, despite their closely related definitions. 

We also need to study classes of graphs for which a-optimization and/or /3-optimization of their reassem¬ 
bling, in any of the forms mentioned above, can be carried out in low-degree polynomial times. Finally, there 
is the question of whether, by allowing approximate solutions, we can turn the NP-hardness of any of the 
preceding optimizations into polynomially-solvable optimizations. The literature on approximation algorithms 
dealing with graph layout problems is likely to be an important resource to draw from (among many other 
papers, the older |[Il|9l[T4l the more recent |4], and the survey ifT^ l. 
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A Appendix: Sequential Graph Reassembling 


Let ^ be the set of all the partitions of the set V = {ui,..., Vn} of vertices in the graph G = (V, E). There are 
two special partitions in 

and Poo-={V}. 

Given two partitions X,Y € we write X if X is finer than Y or, equivalently, Y is coarser than X, 
i.e., for every block A € X there is a block B €Y such that A B. We write X ^ y iff X '^Y and X tY. 
The relation “e” is a (non-strict) partial order on with a least element {the. finest partition Pq) and a largest 
element (the coarsest partition Poo). We need the following simple fact. 

Lemma 43 . In the poset of all partitions of n elements, a maximal chain (linearly ordered with cj is a 

sequence of n partitions, always starting with Pq and ending with Poo. 


Proof If Xi c X 2 c ■■■ c Xk is a maximal chain of partitions, necessarily with Xi - Pq and X^ = Poo because 
the chain is maximal, then Xi has n blocks, X 2 has n - 1 blocks, in general Xp has n- p + 1 blocks, and Xk 
has one block. The length k of the chain is therefore exactly n. □ 


Definition 44 (Sequential graph reassembling, i.e., according to an ordering of the edges). Let 0 be an ordering 
of the edges in E. We use 0 to select n partitions in tiP forming a maximal chain (linearly ordered with c), 
which starts with the finest partition Pq and ends with the coarsest partition Poo = {V}, say: 

Xi c X 2 c ■■■ c Xn where Xi = Pq and X^ = Poo, 


as we explain next. To define Xp^i from Xp, we associafe each Xp wifh a subsequence 0p of fhe inifial 
sequence 0i = 0, for every p ^ 1. The subsequence 0p keeps frack of all fhe edges fhat have not yet been 
reconnected. We obtain the next pair (Xp+i, 0p+i) from the preceding pair i^Xp, 0p) as follows: 

(1) Take the first edge e in the sequence 0p, i.e., let 0p = e 0^ for some 0^, with e = Ww for some v,w &V, 
and let A and B be the unique blocks in Xp containing v and w, respectively. 

(2) Merge the two blocks A and B to obtain Xp+\, i.e., let: 

{Xp-{A,B})vj{AvjB}. 

(3) Delete every edge e' whose two endpoints are in the new block ^ u P to obtain 0p+i, i.e., let: 

0p+i ■= Op I {e ^ E \ e = v' w' and w'} E A u P }. 


In words, we go from (Xp, 0p) to (2fp+i, 0p+i) by merging the two blocks A and P in Xp that are connected 
by the first edge e in 0p, and then removing from further consideration all edges whose endpoints are in A u P. 

We refer to the sequential reassembling of G according to the ordering 0 by writing (G, 0), the result of 
which is the chain of partitions T” = Xi c ■■■ c Xn, more succintly written also as X = Xi ■■■ X„. □ 


Remark 45 . In Definition]^ when we merge the two blocks A and P because the edge e has its two endpoints 
in A and P, not only do we reconnect the two halves of e, but we additionally reconnect every other edge e' 
whose two endpoints are also in A and P. Thus, in general, we may reconnect several edges simultaneously - 
all the edges between A and P in the original graph - rather than one at a time by strictly following the order 
specified by 0. The same happens with binary graph-reassembling (Definition^. □ 


Definition 44 describes the process of going from an ordering 0 of edges to a maximal chain X of parti¬ 
tions. If X = Xi ■■■ Xn is a maximal chain of partitions, we say that X is strict if, for every consecutive pair 
(Xp, Xp+i) with A,B€ Xp and Au B € Xp+i, where 1 ^ p < n, it is the case that dc{A, B) t 0. The result 
of a sequential reassembling (G, 0) is always a strict maximal chain X of partitions. We can also carry out the 
process in reverse, as asserted by the next lemma. 
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Lemma 46 (From a maximal chain X of partitions to an ordering 0 that induces it). Let G = {V,E) be 
a graph, and the set of all partitions of V, as in Definition 44 For every maximal chain of partitions 
X = Xi c ■■■ c Xn, with Xi ,..., Xn € if X is strict, then there is an ordering (not necessarily unique) 0 of 
E such that (G, 0) = X. 


Proof This is a consequence of Definition]^ Details omitted. 


□ 


We want to relate the two notions: sequential reassembling (G, 0) in Definition 44 and binary reassem¬ 
bling {G,B) 'm Definition]^ The discussion to follow uses the following facts. 


Lemma 47. Let B be a binary tree over V = {t>i,..., Vn}, given in the formulation of Definition^ 

1. If S = {Xi,..., X]f\ B is a maximal collection ofk ^ 2 pairwise disjoint sets in B, then S is a partition 
ofV. We call such a maximal collection S a cross-section of the binary tree B. 

2. If dX is the set of all cross-sections of B, then (.5^,e) is a proper sub-poset of the poset in 

Lemma^3\ with the same bottom element Pq and top element Poo- 

3. In the poset a maximal chain has exactly n entries, always starting with Pq and ending with Poo- 


Proof All three parts can be proved by induction on n ^ 1, using the same reasoning as in the proofs of 
Propositions]^ and ]^ All details omitted. □ 

In the preceding lemma, it is worth noting that the size of is fixed as a function of n, the so-called Bell 
number B{n), which counts the partitions of an n-element set and grows exponentially in nj^By contrast, the 
size of y is much smaller, depends on both n and the shape of the binary tree B, and can be as small as n (the 
case when P is a linear, i.e., a degenerate binary tree). 

Consider a sequential reassembling (G, 0) of the graph G = {V,E), the result of which is a maximal 
chain of n partitions X = Xi cl ■■■ c Xn, as in Definition ]^ where Xi = Pq and Xn = Poo■ Since every 
successive partition Xp+i in X is obtained from the previous Xp by merging two blocks in Xp, there is a natural 
way of organizing X in the form of a binary tree B, with n (not all) of the cross-sections of B being exactly 
{Xi,..., Xn}- Let binary{G, 0) denote the binary reassembling thus obtained. 

Consider next a binary reassembling (G, B) of the graph G = {V, E)- The set y of all cross-sections in 
B is uniquely defined. We want to extract from .y a maximal chain X of cross-sections/partitions, ordered by 
c, which, by Lemma ]^ will in turn induce an ordering of 0 of the edges. The problem here is that there are 
generally many such maximal chains X. We need therefore a method to canonically extract a unique maximal 
chain X from y and a unique edge-ordering 0 from X. We propose such a method in the next paragraph. 

We assume that the binary reassembling (G, B) is strict and that there is a fixed ordering of the vertices, 
say, vi < V 2 < ■■■ < Vn- The vertex ordering is extended to edges, and to sets of edges, as follows: 

• If e = vw is the edge joining vertices v and w, we assume v < w- 

• If e = vw and e' = v' w', then e < e' iff either v < v' or v = v' and w < w' - 

• If A c p, then canon(A) is the canonical ordering of A w.r.t. “<”, Le-, canon{A) = ei 62 ■■■ 

where A = {ei, 62 , - - -, e^} and ei < 62 < < e^. 

• If A and B are non-empty disjoint set of edges, with canon{A) = ei 62 ■■■ and canon{B) = fi /2 
then canon{A) < canon{B) iff ei < /i. 

If FF e P, then Bw is the subtree of B rooted at W (see Proposition]^. We write (G, Bw) for a partial binary 
reassembling of the graph G = {V,E), the result being the subgraph of G induced by W together with all the 

*There is no known simple expression for the exponential growth of B{n) as a function of n, though there are various ways of 
estimating tight lower bounds and tight upper bounds on its asymptotic growth CD. 
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edges in dciW) as dangling edges, i.e., edges with only one endpoint in W. We define a canonical ordering of 
all the edges already in place in the partial reassembling (G, Bw), denoted canon{G, By/), as follows: 


canon{G,Bw) ■- 


(the empty string) if 14^ is a singleton set, 

canon{G,BT) canon{G,Bu) canon{d{T,U)) ifW = T&U and, 

canon{G,BT) < canon{G,Bu), 

canon{G,Bu) canon{G,BT) canon{d{T,U)) if 14^ = T&U and, 

canon{G,Bu) < canon{G,B t)■ 


Because the binary reassembling (G,B) is strict, d(T,U) ^ 0 in the second and third cases above, which 
implies canon(d(T, U)) i=e.lfW = V, then {G,B) = {G,Bw) and canon{G,B) = canon{G,Bw). 

Proposition 48 (Relating sequential reassembling and binary reassembling). Let G = {V, E) be a simple undi¬ 
rected graph. We have the following facts: 

1. For every sequential reassembling (G, 0), there is a binary tree B over V such that: 
binary (G,0) = {G,B). 

2. For every strict binary reassembling (G, B), there is an ordering 0 ofE such that: 
canon(G,i3) = (G,0). 

3. For every strict binary reassembling (G, B), it holds that: binary(canon(G, 13)) = (G, B). 

Proof Parts 1 and 2 follow from the definitions and discussion that precede the proposition. All details omit¬ 
ted. Part 3 can be proved by structural induction on the subtrees By/ of B, where the induction hypothesis is 
binary{canon{G, By/)) = (G, By/). All details omitted again. □ 

It is possible to refine fhe nofion of “canonical ordering” on fhe sef of edges E, so fhaf fhe equably 
canon{binary{G, 0)) = (G, 0) holds which, logefher wifh fhe equably in pari 3 of fhe preceding proposilion, 
will mean lhal fhe funclions binary{) and canon{) are inverses of each olher. We omif Ibis refinemenl as if will 
go furlher afield from our main concerns. 
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B Appendix: Remaining Proofs for Sections and 

We supply the details of several long straightforward and/or highly technical proofs which we omitted in Sec¬ 
tions!^ and in order to facilitate the grasp of the different concepts and their mutual dependence. 


Proof of Theorem 26 Let G = {V,E)h& an arbitrary simple undirected graph, with \ V\ = n. It suffices to 


show that if (G, (p) is an a-optimal linear arrangement, then the linear reassembling (G, C) induced by (G, p) 
is also a-optimal, using Definition [TT] 

In the notation of Definition [TT| the clusters of C, of size ^ 2 are {Xi,..., Xn-i}- For the singleton clusters 
of C, we pose li_i := {y?“^(/)}, where 1 ^ i ^ n. From Definitionj^ 


a{G, ip) = max { degree{YQ), max { degree{Xj) | 1 ^ j ^ n - 1}}, 

a{G, C) = max I m.a,yi{degree{Yi) | 0 ^ y ^ n - 1}, max { degree {Xj) |l^y^n-l}}. 


By way of getting a contradiction, assume that (G, p) is a-optimal but that the induced (G, C) is not a-optimal. 
Hence, there is another linear reassembling (G, C) which is a-optimal such that a(G, C) < a(G, C). Using 
the same notation for both {G,C) and (G, £'), where every name related to the latter is decorated with a prime, 
the inequality a{G,C') < a{G,C) implies the inequality: 


max { max { degree{Y^) | 0 ^ y ^ n - 1}, max { degree{Xj) | 1 ^ y ^ n - 1}} 

< max { max { degree{Yi) | 0 ^ y ^ n - 1}, max { degree{Xj) | 1 ^ y ^ n - 1}}. 

But m.ax.{degree{Yl) 10 ^ y ^ n - 1} = max.{degree(Yi) | 0 ^ y ^ n - 1}, which implies two inequalities: 


(1) max { degree{Yi) | 1 ^ y ^ n - 1} < max { degree{Xj) | 1 ^ y ^ n - 1}, 

(2) max { degree{Xj) | 1 ^ y ^ n - 1} < max { degree{Xj) | 1 ^ y ^ n - 1}. 

Hence, by inequality ([!]), we have: 


a(G,p) = a{G,C) = max{<iegree(Xj) | 1 ^ y ^ n - 1}. 


Consider now the linear arrangement (G, p') induced by the linear reassembling (G, £'), using Definition 10 
We have: 


a{G,p') = max degree (Yq), max { degree (Xj) \ l^y ^n-1}}. 

If degree^Y^) ^ max{degree{Xj)\ 1 ^ y ^ n - 1}, then inequality Q implies a{G,p') < a{G,p), else 
inequality ([^ implies again q;(G, p') < a(G, p). In both cases, the a-optimality of (G, p) is contradicted. □ 


For the proofs of Lemma and Lemma [37j we take a closer look at how the vertices of iFp+i are po¬ 
sitioned in the sequence S in Q in Section From the fact that p is the sum of all the vertex degrees in G, 
it follows that p is even and p -i- 1 odd. From the sequence S, we can extract the subsequence (S | Ffp+i) 
consisting of all the vertices of iLp+i and corresponding cut widths: 

(❖❖) {S\Kp+i) = [xq sq Xi^ Si^ . Xi^ Si^ 

where {ii,... ,ip+i} £ {1,...,n -i-p} and {x*^,... ,Xi^^^} = {u\,... ,Up} u {m}. In the preceding sequence, 
every vertex has the same degree p in the subgraph Kp+i. In the full graph G^, every vertex from Ffp+i has 
again the same degree p, except for the distinguished vertex w which has degree p + d where d = degree q{w). 
In particular, we have: 

Sh=P, Si2=2-(p-l), Si3 = 3-(p-2), ... , = (p-1 ) • 2, Sip=p. 
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The mid-point of (5 | Kp+i) is The two adjacent cutwidths of the mid-point are: 

'5*(p/ 2) = I ■ (| + 1) Si(p/2)+l = (| + 1) ■ 

SO that also, as one can readily check: 

_ p^ + 2p _ 

\pI2) ~ '®*(p/2)+i ~ ^ ~ inax{sjj, Sj2) • • •) %}) 


and the sequence of cutwidths (sjj,..., Sj^) is equal to its own reverse (si^,..., Si^). Moreover, for every j 
such that 1 ^ y < ii or ip+i < j ^ n + p, we have Sj = 0. Also, it is intuitively useful for the argument in the 
proof of Lemmato keep in mind that: 


{si^ — Sj) = 

P, 

for every 

1 < h, 

{si2 - Sj) = 

p-2, 

for every 

h^j< *2, 

(Sjg - Sj) = 

1 

: 

for every 

*2 ^ j < *3, 

(^i(p/2)-^j) = 

2, 

for every 

*(p/2)-l ^ j ^ ^p/2) 

(^i(p/2)+l ~ ~ 

0, 

for every 

Hp/2) ^ j < *(p/2)+l 


Proof of Lemma |36l In the sequence S in Q in Section]^ suppose: 

• Xi is the leftmost vertex in 17 u {m}, 

• Xj is the leftmost vertex inV - {m} to the right of Xi, 

• xi is the rightmost vertex in 1/ u {m}, 

• Xk is the rightmost vertex in L - {tu} to the left of Xi, 

where l^i^j^k^i^p + n. Graphically, S can be represented by: 


xi 


Xi—\ Xi 


Xj-I 


Xj+I 


^k-1 


^k+1 


Xp-\.n 


all inV - {if;} aH in f/ u {k;} 


all inU u {it;} 


all in V - {in} 


The circled vertices, Xj and Xk, are in L - {m}. If S is scattered, then 1 ^ i < j and/or k < i ^ n + p, with 
the possibility that j = kin which case there is only one vertex in L - {?/;} inserted between all the vertices of 
U u {m}. We define: 


scatter{S) '■= min{j-f, l-k} ^ 1, 


when S is scattered. If S is not scattered, we set scatter{S) := 0, so that S is scattered iff scatter{S) ^ 1. 
Moreover, with p even and p -i-1 odd, it is always the case that scatter{S) ^ p/2, so that if S is scattered, then: 

1 ^ sc alter {S) ^ 

To complete the proof, it suffices to show that if S is scattered, then we can define another sequence S' from S 
such that: 


I3{S') < (3{S) and scatter{S') < scatter{S). 
We obtain S' from S as follows: 
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• if j - i ^ - A:, remove Xj from the j-th position and insert it between Xi-i and Xi, 

• j - i > I - k, remove Xk from the A-th position and insert it between xi and x^+i- 
With no loss of generality, let j - i I - k. The portion of S under eonsideration is therefore: 

(ri_i,Si_i) Xi {ri,Si) ••• {rj-2,Sj-2) Xj-i {rj,Sj) 

^ V 

all in [/ u {lo} 

and the order of all the vertices in the new S' is: 

Xi "• Xi-i ' ^k-l ^k+l ^i+1 ^p+n 

^ >v* ^ ^ ^ V ^ ^ 

all in y - {in} all in f/ u {in} all in f/ u {in} all inV - {in} 

Let Xj = V € V - {ru} and partition d - degreeQ{v) into d= + d^, where: 

• d^ is the number of vertiees in {xi,..., Xj-i} n V whieh are eonneeted to v, 

• d^ is the number of vertiees in {xj+i,..., Xn+p} n V whieh are eonneeted to v. 

There are different eases, depending on: 

• the value of y - { between 1 and p/2, 

• the value of d^ - d^ between -d and +d, 

• whether the distinguished vertex w is in {xi ,..., Xj-i} or in {xj+i ,..., }, 

• whether v is eonneeted to w or not. 

We eonsider only one of the eases, which is also a “worst case” to explain, and leave to the reader all the other 
cases, which are simple variations of this “worst case”. For the “worst case” which we choose to consider, let: 

(1) j = i+ p/2 so that j - i = p/2, 

(2) = 0 so that - d^ = d, 

(3) misin{xj+i,...,X£}, 

(4) there is an edge Ww connecting v and w. 

With assumptions (1) to (4), as well as after: 

• substituting i + (p/2) for j, 

• replacing the sequence of cutwidths Si- 1 , Sj, ... , Si+(p/ 2 )-i, Si+(p/ 2 ), 'Si+(p/ 2 )+i 

by their actual values 0, p, ... , (p^ + 2p - 8)/4, (p^ + 2p)/4, (p^ + 2p)/4, respectively, 

• and posing r := r^-i, 

the portion of S under consideration becomes: 

(r,0) Xi (r,p) ••• (r,(p^+ 2p-8)/4) Xi+(p/ 2 ) (r, (p^ + 2p)/4) Q (r + d, (p^ + 2p)/4) 

all inU u {ifi} 

The corresponding portion in the new S' is: 

(r,0) Q (r + d,0) Xi {r + d,p) ■■■ (r + d, (p^ + 2p - 8)/4) Xi^p/ 2 ) (r + d, (p^ + 2p)/4) 

V 

all inU u {tii} 
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with all the cutwidths to the left and to the right of the shown portion being identical in S and S'. It is now 
readily seen that the value of /3(5') is: 

/3(5') = /3(5)-(p2 + 2p)/4 + (p/2)d = /3(5) - 

Let A := X!{ degreeQ{x) \ x ^V}. By the construction of the auxiliary graph Gw, we have p = A. The value 
of d = degreeQ{v) ^ A/2, the upper bound A/2 being the extreme case when G is a star graph with: v at its 
center, A/2 leaf vertices among {xj+i ,..., Xn+p} ri V, and all other vertices of V being isolated. Hence, 

p‘^-2{d-l)p ^ A2-2(^-1)A = A‘^-A‘^ + 2A = 2A. 

Hence, /3{S') ^ /3{S) - A/2, so that f3{S') < 13{S) which is the desired conclusion. □ 

For precision in the next proof, we introduce the measure of unbalance. 

Definition 49 {Unbalance). Consider the sequence S in Q in Section 

• Let ^ 0 be the number of vertices from C - {m} to the left of w, and ^ 0 be the number of vertices 
from 1/ - {m} to the right of w. 

• Let 6^ ^ 0 be the number of vertices from U to the left of w, and 6^ ^ 0 be the number of vertices from U 
to the right of w. 

The unbalance of S is measured by: 

unbal{S) := min { (n - - 1) + (p - 6^), {n--1) + {p-h^)}. 

The quantity (n - - 1) + (p - h^) measures 5’s unbalance on the left, and similarly (n - - 1) + (p - b^) 

measures 5’s unbalance on the right. It is useful to keep in mind that: 

{p-b^) + {p-b^) = p and (n - - 1) + (n - - 1) = n - 1, 

so that, if the quantity (n - - 1) + (p - b^) or the quantity (n - - 1) + (p - b^) is reduced to 0, then the 

other of these two quantities is increased to n - 1 + p. □ 


Proof of Lemma By Lemma we can assume that S is not scattered. It suffices to show that if 
unbal{S) ^ 1, we can define another sequence S' from S such that /3{S') < f3{S) and unbal{S') < unbal{S). 
We use the notation in the proof of LemmaThe portion of S that we examine closely is: 


(rj+ijSj+i) ■■■ , Sj+p_i) Xi^.p (rj+p, Sj.|.p) 


all inU u {m} 


where: 


V-{w} = {xl,...,xi-l}l^{xi+p+l,...,xn+p}, withl^i^n, 

U u {tu} = {xi,..., Xi+p}, with w = Xi+k and 0 ^p. 

We partition d = degreeQ{xi+k) = degreeQ{w) into d = d^ + d^, where: 

• ^ 0 is the number of vertices in {xi,..., Xj_i} which are connected to m = Xi+k in G, 

• d^ ^ 0 is the number of vertices in {xi+p+i, • • •, Xn+p} which are connected to tu = Xj+fe in G. 
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And we partition e = degreej^^^^{xi+k) = degree into e = = p, where: 


• = fc is the number of vertices in {xj,..., Xi+k-i} which are connected to m = Xi+k in Kp+i, 

• = {p- k) is the number of vertices in {xj+fc+i,..., Xj+p} which are connected to m = Xi+k in Kp+i. 


Though not explicitly used below, it is worth noting that and here are the same as and in Defi- 
because Kp+i is a complete graph (but and d^ are not the same as and a^). We consider 5 


nition 
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separate cases, {(a), (b), (c), (d), (e)}: 
(a) k = 0, which implies = 0 and 


■ p. 


In case (a), because unbal{S) ^ 0 by hypothesis, it must be that {xj+p+i,..., x^+p} ^ 0. It suffices to move the 
vertices in {xj+p+i,..., x„,+p} to the left of m = Xj, also preserving their order 


Xi, , Xi-\^ 

Using a reasoning similar to that in the proof of Lemmawe leave it to the reader to show that unbal{S') = 0 
and /3{S') < f3{S) for the resulting sequence S'. 

(b) k = p, which implies = p and = 0. 

Case (b) is similar to case (a). Because unbal{S) ^ 0 by hypothesis, it must be that {xi,..., Xj_i} ^ 0. In this 
case, we move the vertices in {xi,..., Xj_i} to the right of tu = Xj+p. Again, we leave it to the reader to show 
that unbal{S') = 0 and /3{S') < /3{S) for the resulting sequence S'. 

For the three remaining cases, we can assume that neither k - 0 nor k = p, i.e., both m ^ x* and w ^ Xj+p. 
Two cases of these three are: 

(c) d^ > d^, in which case we tranpose w = Xi+k and x^. 

(d) d^ < d^, in which case we transpose w = Xi+k and Xi+p. 

By a reasoning similar to that in the proof of Lemma we leave to the reader the straightforward details 
showing that P{S') < /3{S) in both case (c) and case (d). 

(e) d^ = d^, in which case the value of P{S) remains unchanged by tranposing w = Xi+k and Xi, or by 
transposing w = Xi+k and Xj+p, and so we need an additional argument. 

The additional argument for case (e), is to first transpose w = Xi+k and Xi, or alternatively transpose w = Xi+k 
and Xi+p, thus reducing case (e) to case (a), or alternatively reducing case (e) to case (b). □ 
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